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We characterize a value of an observable by a 'sum rule' for generally non-commuting observables 
and a 'product rule' when restricted to a maximal commuting subalgebra of observables together 
with the requirement that the value is unity for the projection operator of the prepared state and the 
values are zero for the projection operators of the states which are orthogonal to the prepared state. 
The crucial requirement is that the expectation value and the variance of an observable should be 
independent of the way of measurement, i.e., the choice of the maximal commuting subalgebra of 
observables. We shall call the value a 'contextual value'. We show that the contextual value of an 
observable coincides with the weak value advocated by Aharonov and his colleagues by demanding 
the consistency of quantum mechanics with Kolmogorov's measure theory of probability. This also 
gives a derivation of Born's rule, which is one of the axioms of conventional quantum mechanics. 
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I. INTRODUCTION 

O !■ In the conventional Copenhagen interpretation of quantum mechanics, the value of physical quantity emerges only 
after measurement but not before. However, the idea of value only after measurement becomes problematic when one 
considers quantum gravity, because measurements presuppose spacetime which emerges after the quantum era of the 
Universe apart from the fact that there are no observers who can make projective measurements of the Universe 
Independently Ozawa Q has pursued the possibility of assigning a context dependent value for a physical quantity 
before measurement, where the context describes outcomes of the measurements to be performed. 

On the other hand, Born's rule Q is an axiom of conventional quantum mechanics, though there is literature which 
claims that the rule can be derived. It seems that there is a lack of either physical intuition because of a lack of a 
measurement process or a lack of mathematical rigor. We shall give a brief review of these notions restricting our 
attention to the orthodox view in the discussion part of the present work but not to the many world interpretation of 
quantum mechanics [D, ||| . 

In the present paper we show that the contextual value of an observable coincides with the weak value proposed 
by Aharonov and his colleagues 0, H[ on the basis of consistency of quantum mechanics with Kolmogorov's measure 
theory of probability Q and that this consistency also leads to Born's rule as the unique probability measure of 
events to be discovered by the post-selection. 

The organization of the present paper is as follows. After a brief preliminary in Sect 2, we present in Sect 3 a 
theorem that the weak values are the contextual values of observables and give a derivation of Born's rule. The final 
section is a summary and discussion of our results and includes a brief review of the claims of the derivation of Born's 
rule. 



II. PRELIMINARY 



We first set up the general framework. Let AT be a set of observables which act on the Hilbert space T~L of finite 
dimension, i.e., diniH = N. Let V(Af) be the set of all maximal abelian subalgebras of Af and choose a maximal 
abelian subalgebra Vi nax € V(7V) , noting that the choice is not unique. Then we can select a complete orthonormal 
system {|w)} we n as the set of all simultaneous eigenstates of the elements of U max , where f2 := {lu\,--- ,w/v}- 
Hereafter, we identify the cigenspace of with u. Therefore, 51 represents the way of orthogonal decomposition of 

the Hilbert space %. We call the set {|w)} we n 'context' induced by V meix which is chosen from V(7V) max . In due course 
we will use Aharonov's formalism to interpret the complete orthonormal system {|w)} w gn as the set of the states to be 
'post-selected'. Thus, since we fix the prepared, i.e., pre-selected state to be throughout this paper, the 'context' 
is the context of the measurement to be performed. In the case of a finite dimensional Hilbert space, a different choice 
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of Vmax £ VC^Omax amounts to a unitary transformation of the complete orthonormal system {|o;)} we n- For infinite 
dimensional case, we have many types of von Neumann algebras [10L lllf . 

We introduce the probability space (ti, P, P) adopting the standard notation of Kolmogorov's measure theory of 
probability as follows. The sample space is VL := {ui £ il \ (u)\ip) 7^ 0}, and the cr-field JF over Cl is the power set 
of Cl. P is a probability measure over T. The measure P satisfies the relation: P({uii} U {^2}) = P ({1^1, 1^2}) = 
P({cji}) + P({uj2})- This means the probability of the event that u)\ is measured or ui2 is measured is the sum of 
P({wi}) and P({u>2})- This probability space excludes the case {oj £ tt\(ui\ip) = 0}. We shall deal with the above case 
at the end of Sect 3. For notational simplicity, hereafter we represent the set {uj} £ P as ui so that P(w) := P({ui}). 
For a fixed choice of V max , the field P is also fixed provided that we specify the prepared state \tp). 

In what follows, we use the standard notation that R and C are the fields of real and complex numbers, respectively, 
and o~ x ,o~ y , a z are the Pauli matrices. 

III. THE MAIN THEOREM 

We denote the contextual value of A £ j\f in uj £ CI by A w (A) and define it as the nonzero map A w : j\f — > C which 
satisfies the following conditions. 

(i) Sum Rule: 

X u {A + B)=X u (A)+X ilJ (B), VA,B£JV 

(ii) Product Rule: 

X^TS) = \u(T)\„(S), VT, S £ 

(iii) Initial condition: 

MMM) = i, \ u (\iP ± )(i> ± \) = o, 

where \ip) is the prepared (pre-selected) state and \ip ) is an arbitrary state orthogonal to \ifj). 

(iv) Invariance: The expectation value and the variance of an observable A defined by 

Ex(A) := J2 H")MA), (1) 

Var(A):=J2 P{u)\\u>{A) - Ex{A)\ 2 (2) 
wen 

are independent of the choice of V max and depends only on the prepared state \ip) and A. 

Some remarks on these conditions are in order. The sum rule (i) excludes the possibility that the values A U (A) 
and X U1 (B) are eigenvalues of the observables A and B, if they do not commute. A simple example of this is o x and 
o~y with the ei genv alues ±1, while the sum o x + o y has eigenvalues ±y/2. Historically, the rule (i) was introduced by 
von Neumann [1(| when he made an argument to refute the hidden variable theory. The product rule (ii) means that 
the map A w is a character on V niax and implies that it can be seen as a ' valuation' on V max 0. This means that the 
contextual value for an observable T £ V mllx can be identified with a classical value of P. We are going to generalize it 
to a contextual value of an observable A £" V max , which has no counterparts in classical theory. We see that A aj (l) = 1 
from (ii). The requirement (iii) is just an assumption based on an intuition that the value of the projection operator 
of the initial state is unity and the values of the projection operators of the states \i(j' L ) which are orthogonal to 
the prepared j^) are zero if we know definitely what the initial states is. The invariance of the expectation value and 
the variance of an obsevablc A in the requirement (iv) uniquely determines the normal distribution function of the 
observable obtained after repeating many experiments according to the central limit theorem. 

From the sum rule (i) we can apply Riez's theorem and using the normalization A aj (l) = 1, we can write the 
contextual value of A £ Af as 



(3) 
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with Wu, being some operator. The product rule (ii) for the maximal abclian subalgebra V max narrows down the 
expression for the operator W u to 

W u = a\a){u\ + b\u)(P\+W£, a,foeC, (4) 

for some |a) £ H and (/3\ E H* , where H* is the dual space of T~L. Here Wjj- is an operator which satisfies 
{W,j, = 0. The requirement (iii) implies that W u has a form W u = \ip)(q\ + for some (q\ and |r). 

Equating this with (j4}, we find that \a) = \tp), (f3\ = (tp\ and W„ = so that 

W u = a\ij)(oj\+b\<j)(ip\, a,beC, (5) 

and the contextual value becomes 

f ^_ a( u \A\^)+b(^\A\u;) 

So far our discussion for the value of a physical quantity has not touched upon probabilistic interpretation. At this 
stage we formally introduce the concept of probability following the standard probability theory of Kolmogorov [{J . 
We use the formula for the expectation value of an observable A £ Af, 

Ex{A) = £ P{«)K{A), (7) 
wen 

where the contextual value X U1 (A) of A in u> 6 O is given by ©. The expectation value Ex{A) is an average of the 
contextual value X U] {A) over the context of measurement {|w)} w en- The X U1 (A) corresponds to the random variable 
for an observable A in the probability theory. Each X LJ (A) depends on \u>) and therefore on how we choose the context 
Knax, but the averaging washes out the dependence on the choice of the context, i.e., how we choose the set of the 
states to be post-selected. We shall show that the condition (iv) leads to a further specification of W 7 ^, i.e., b = in 
([5]) so that 

wl, = MH (8) 

We now state and prove our main result. 
Theorem. A map X^ : N — > C satisfying the above conditions (i)-(iv) is of the form 

X M = ^,tor^n, (9) 

and the probability measure is 

^MHH^)I 2 - (io) 



The contextual value © is identical to the weak value of Aharonov with \ip) and (w| being the pre-selected state 
and the post-selected state, respectively. By this identification, we explicitly see that the context can be interpreted 
as a set of post-selected states. Note that the probability measure P is a matter of choice in the classical probability 
theory, while Born's rule (|10p is mandatory in quantum mechanics and even its non-negativity is an important aspect 
of the consequence. The asymmetry of (tu\ and in ((SJ) comes from the expression for the expectation value ([7]) in 
which the post-selected states are summed up on the basis of an intuition that we can choose a pre-selected state but 
its outcome is uncertain. The arrow of time is built-in in ([?])• The expression (J7J for the expectation value with © 
and (|10[) reproduces the standard quantum mechanical expectation value, 

Ex(A) = (1>\A\1>), (11) 

which certainly does not depend on the context {|w)} we j2- The apparent context dependence of Ex(A) in (JT]) goes away 
by the completeness relation on X^en^l'-^KH^IV') = (V'l^lV')- It is instructive to think about the possibility 
P(u) = |H^)| 4 instead of CJJJ. Then we would have Ex(A) = E ue fi IHV>>| 4i $j^ = E ue h \(u\ip)\ 2 (uj\A\i>) . 
The context dependence would not go away. We can follow the exact parallel way also for the variance Var(A) to 
confirm its context independence. 



4 



Proof. Consider a functional 

N-i 

C E ({w\,fi):=Ex(A)- ^rt^^ilwXwIAI^-^il^)], Mi G R, (12) 

where Mi are the Lagrange multipliers and {IV'i)} is a complete orthonormal system which contains = as a 
unit vector. The Lagrange multiplier terms give constraints 

^(V- i |o;)(w| J 4|V)-(Vi|A|V) = 0, Vi, VAgAA (13) 

Since {IV'i)} is a complete orthonormal system and A G M is arbitrary, the above constraints are equivalent to the 
completeness relation XLen = 1 f° r {I w )}wgo as we see comparing with its matrix elements with (fT3l) . 

We demand that the variation of Ce((u>\,h) with respect to (ui\ should vanish. That is, 



5£e{(w\,h) _ dP{u) a(u\A\il>)+b(il>\A\ijj) , p, \ aA\tp) 



tn^wliy «M ~ Si Mi(^) A\ i>) = 0. VieA/' 

A particular choice A = IV'XV'I implies that ^jS^-p <x |^>), since the second, third and fourth terms are then 
oc \ip). To see the implication of the component of the above equation perpendicular to \tp) consider the case 
A = )(i/>\ + where \ip ± ) is a state orthogonal to to find that the second and the forth terms cancel 

each other. We therefore see that the sum of the first and the third terms and the sum of the second and the fourth 
terms separately vanish, i.e., 

ft* - a { J$ { * \ u) «\1>) = 0. ( 14 ) 
- Ei^M = o. (15) 



Adding these two equations yields 



dP(u) 



= J2»iWi\"M (is) 



d{u\ 

and its integration gives P(uj) = Ei Mi(V'il w )( a; l' ( /') + Po, where Pq is an integration constant. Therefore, (1151) becomes 
(J>Mw><a#) + P ) ° = 0,V W G n. (17) 

This implies that Po = and 6 = 0. We note that ^^rjr 2 ^ = is automatically satisfied. Therefore, the context 
invariance of the expectation value Ex(A) leads to expressions for the probability measure and the contextual value 
of an observable A as 



PM = EiMi(^>H^>, (is) 

MAW 

Hv>> 



MA) = W^- (19) 



We further demand that the variance Var(A) = E^en ^( w ) 1^(^4)1 2 is context independent. To simplify the discus- 
sion we set Ex{A) = or replace A by A — Ex{A). It is possible to set up a similar variational problem for Var(A) 
to the previous one for Ex(A). Instead we simply insert the result of (fTT)| into the expression for Var(A) to obtain 

= E. e n£. i M^)H^)l^| 2 

= E ue nEiMi^H^)<^|w>. (20) 

This is independent of the context {|oj)} we o if and only if Mi = for i 7^ 0. The remaining parameter mo is fixed to 
be unity by the normalization condition Ewefi -P( w ) = Mo — 1- Then we arrive at the main theorem Q and (jlOp . ■ 

Now we can naturally extend the probability space (fl,P,P) to (CI, J 7 , P) as follows. The sample space is f2, and 
the CT-ficld T over fi is the power set of fl. P is the probability measure over T satisfying the following conditions: 
P\jr — P,P\jr\f- = 0. From the theorem, we can write P(uj) = \(uj\ip)\ 2 . Thus the probability space (Ct,P,P) 
describes the standard quantum mechanical statistics in the context {|w)} we n. 
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IV. SUMMARY AND DISCUSSIONS 



We have shown that the contextual value of an observable is the weak value by demanding the consistency of 
quantum mechanics with Kolmogorov's measure theory of probability in conjunction with the consideration of the 
values of obscvablcs. The crucial requirement was that the expectation value and the variance of an observable 
should be independent of the way of measurement. This leads eventually to Born's rule in quantum mechanics. The 
assumptions of the sum rule (i) and the product rule (ii) may not have much problem and the invariance (iv) of the 
expectation value and the variance is crucial to relate the contextual value to the concept of probability. The third 
assumption (iii) is debatable. At the moment we do not have complete justification. We believe the usefulness of (iii) 
to characterize the contextual value. 

Going back to the original motivation in the introduction, we would like to examine in what sense the weak value 
can be regarded as the value before projective measurements. Let A(t) be an operator at time t < T in the Heisenberg 
representation, where T is the time of the projective measurement. We choose A(T) as an element of Knax so that 
the post-selected state (a\ is an cigenstatc of A(T), (a\A(T) = (a\a of the eigenvalue a. The weak value 



(<#) 

can be interpreted as the value before projective measurement provided that (o| is not orthogonal to the pre-selected 
state \tp). At t = T, the weak value (f^Tj) coincides with the eigenvalue a. As stressed before the weak value depends 
on the context, i.e., the states to be post-selected. The price is that the contextual value is complex in general. 

The extension of the derivation of Born's rule to the mixed state case is straightforward but technically involved so 
that we postpone it for future publication. 

From the main theorem we see that the value of an observable is given by the weak value (0 . As is by now well 
known, the weak value is experimentally accessible [l2| - [2l| and theoretically analyzed by many people [23 - l25j . The 
weak value coincides with one of eigenvalues when the observable is restricted to the maximal abelian subalgebra 
Vmax £ V(A/") max . For a general A £ Af, the weak value can be deduced from the information associated with the 
state to be post-selected, which is circumstantially inferred from the set of eigenvalues to be obtained by projective 
measurements of all the elements of Vmax. 

The Kochen-Specker theorem [26| tells us that it is not possible to assign non-contextual values to all observables 
in quantum mechanics. Since the only non-contextual values arc eigenvalues of the observables, this theorem does not 
apply to the weak values. Actually the weak value is explicitly dependent on the context defined by the post-selected 
states. 



In the proof of Gleason's theorem |27| , he demanded that the measure over the set of all the subspaces of the Hilbcrt 
space H should be independent of the choice of basis in order to conclude that the measure has a quadratic form in 
the state vector components if dim('H) > 3. This leads to the standard expression for the expectation value. There 
is a gap between the probability and the measure over the set of all the subspaces of the Hilbert space H, so that 
the derived Born's rule lacks the probabilistic interpretation. In a mathematical sense, Gleason's theorem is close to 
ours. We believe that we have filled the gap by introducing the physical concept of the contextual value of observables 
defined by the prepared state and the states to be post-selected. A rather surprising finding in the present work is 
that the contextual value coincides with the weak value. 

Zurek [28[ claimed that he derived Born's rule from "the environment assisted invariance". He considered an 
entangled state 

m = IfQlgO^M (22) 

where \si) and \s 2 ) are the orthonormal basis of the system, while |ei) and \e 2 ) are the orthonormal basis of the 
environment. Consider an observable: A<S>1 with A = |si)(si| — |s2)(s2|- The elementary events arc X\ := {(si|(ei|} 
and x 2 := {(s2|(e 2 |}. The sample space is Q' := {xx, x 2 }, and the er-ficld is the power set of fi'. We would like to show 
that the probabilities to obtain cci and x 2 are 1/2. To see this we compute the weak values: X Xl (A) = 1, X X2 (A) = — 1. 
We apply the formula ([7]) noting that the left hand side is zero because of symmetry under the SWAP operation of 
the states of 1 and 2 both for the system and environment. The operator A <8> 1 does not distinguish the state of 
environment so that the expectation value has to be symmetric under the SWAP of the system state. Then we 
see that P(xi) — P{x 2 ) = on the right had side and arrive at the equal probability P{x\) = P{x 2 ) = 1/2, using 
Ex(l) = P(xi) + P(x 2 ) = 1. In this simple case, only the discrete symmetry is sufficient to derive Born's rule. 
Here we see that the symmetry principle should apply to the expectation value to obtain Born's rule. The role of 
entanglement with the environment is not clear for us, though we suspect a possible connection to the post-selection. 
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